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The *=30-kb coronavirus (+)RNA genome is replicated and tran¬ 
scribed by a membrane-bound replicase complex made up of 16 
viral nonstructural proteins (nsp) with multiple enzymatic activi¬ 
ties. The complex includes an RNA endonuclease, NendoU, that is 
conserved among nidoviruses but no other RNA virus, making it a 
genetic marker of this virus order. NendoU (nsp15) is a Mn 2+ - 
dependent, uridylate-specific enzyme, which leaves 2'-3'-cyclic 
phosphates 5' to the cleaved bond. Neither biochemical nor se¬ 
quence homology criteria allow a classification of nsp15 into 
existing endonuclease families. Here, we report the crystal struc¬ 
ture of the severe acute respiratory syndrome coronavirus nspl 5 at 
2.6-A resolution. Nspl 5 exhibits a unique fold and assembles into 
a toric hexamer with six potentially active, peripheric catalytic 
sites. The structure and the spatial arrangement of the catalytic 
residues into an RNase A-like active site define a separate endo¬ 
nuclease family, endoU, and represent another spectacular exam¬ 
ple of convergent evolution toward an enzymatic function that is 
critically involved in the coronavirus replication cycle. 

endonuclease | severe acute respiratory syndrome | nidovirus | replication 

C oronaviruses possess the largest and most complex genome 
among RNA viruses [**30 kb of single-stranded (+)RNA] (1). 
The family Coronaviridcie (genera Coronavirus and Toroviriis) be¬ 
longs to the order Nidovirales, which also includes the Arteriviridae 
and Roniviridae. Translation of the coronavirus genome results in 
two replicase polyproteins (of *4,500 and **7,500 aa) that are 
encoded by ORFs la and lb and are known as polyproteins ppla 
and pplab, respectively (2). Expression of the pplab C terminus 
depends on a translation frameshift occurring just upstream of the 
ORFla stop codon (3). Because of this translational regulation of 
gene expression, the five C-terminal proteolytic products, nonstruc¬ 
tural protein (nsp)12 to nspl6, are produced at significantly lower 
amounts than are the ORFla-encoded replicase subunits. Proteo¬ 
lytic processing of coronavirus ppla and pplab by ORFla-encoded 
proteases yields 15-16 nsp, which mediate the replication of the 
viral RNA genome and the synthesis of multiple subgenome-length 
mRNAs (4) that encode the structural and accessory proteins (2). 

Recently, coronaviruses received both general attention and 
renewed interest after a worldwide outbreak of the severe acute 
respiratory syndrome (SARS) in 2003. The etiological agent of 
SARS was identified as a group II coronavirus, which was named 
SARS-CoV (5-7). The genome contains **29.7 kb and codes for 16 
nsps, 5 structural proteins, and 7 accessory proteins (2). In the last 
two years, several of the SARS-CoV replicase subunits including a 
number of enzymatic activities have been characterized (reviewed 
in refs. 1 and 2). Furthermore, crystal structures of five nsps (or 
subdomains thereof) have been reported: the ADP-ribose-l''- 
phosphatase macro domain of nsp3, the main protease nsp5, the 
RNA-binding protein nsp9, and the complex of nsp7-nsp8, a 
proposed processivity factor for the RNA-dependent RNA poly¬ 


merase nspl2 (8-12). nspl3 is a superfamily 1 helicase (7, 13), 
whereas nspl4, nspl5, and nspl6 have been predicted or confirmed 
to harbor 3'-5' exonuclease, endonuclease, and 2'-0-ribose meth- 
yltransferase activities, respectively (7, 13-15). Among these, the 
nidovirus uridylate-specific endoribonudease (NendoU) residing 
in nspl5 is considered a major genetic marker of nidoviruses 
because of its nidovirus-wide conservation but complete absence in 
other RNA viruses (16). The biochemical properties of nspl5 have 
been characterized to some extent, and the protein has been shown 
to be critical for coronavirus RNA synthesis (15-17). NendoU is 
distantly related with the Xenopus laevis endoribonudease, Xen- 
doU, that is involved in the processing of intron-encoded box C/D 
U16 small, nucleolar RNA (18). 

Catalytic features, such as metal-ion dependence, are often used 
to classify RNases into two main groups. The first group employs 
a two-metal-ion mechanism, with one ion activating the substrate 
and the second one stabilizing the transition state (19). In the 
second group, two side chains, typically those of His residues, act as 
a general base and general acid in a coordinated fashion to promote 
cleavage (20, 21). However, neither nspl5 nor XendoU can be 
easily assigned to one of these enzyme classes: in common with 
enzymes from the first class, NendoU and its homologs require 
metal ions for activity. However, unlike these enzymes, NendoU 
and XendoU generate 2'—3' cyclic phosphates upon cleavage, which 
is a typical feature of RNases belonging to the second class (18). In 
previous studies, potential catalytic amino acid residues of NendoU 
have been identified by site-directed mutagenesis (16, 17), and the 
molecule has been reported to form monomers, trimers, and 
hexamers in solution (17). Here, we report the crystal structure of 
the SARS-CoV NendoU (nspl5) at 2.6-A resolution. The folds of 
each of the three nsp 15 subdomains and their spatial arrangement 
were found to be unprecedented. We also show that nspl5 assem¬ 
bles into hexamers presenting six independent catalytic sites at their 
periphery. The combined data define a novel family, endoU, of 
RNA endonucleases. 

Results 

The Monomer Structure. The nspl5 crystals we obtained contained 
one monomer per asymmetric unit. The experimental electron 
density, obtained from single-wavelength anomalous dispersion 
phasing to 2.9 A on a selenomethionine-substituted crystal, was of 
sufficient quality to trace the whole chain, including residues 5 and 
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Fig. 1. Structure of nsp15, the endoribonuclease from SARS-CoV. (a and b) Cartoon representation (a) and topology diagram (b) of nsp15 monomer. The 
structure consists of three domains: N-terminal domain (a1-a2), central domain (a3-/38), and C-terminal domain (a6-/314). Secondary structures are colored as 
follows: red, a-helices; yellow, /3-sheets; and green, loops, (c) A view of the nspl 5 hexamer. In the trimer shown in the foreground, the subdomains of one of 
the monomers are colored as follows: green, N-terminal domain; yellow, central domain; and red, C-terminal domain. The other two molecules are shown in 
blue and magenta. The molecules belonging to the trimer in the background are shown in gray, (cf) 90° rotation of the nspl 5 hexamer as shown in c. (e) Surface 
representation of the nspl 5 hexamer. Solid arrows indicate the active sites of the trimer in the foreground; dashed arrows show the active sites of the trimer 
in the background, (f) The hexamer shown in e is rotated by 90°. In this orientation, two active sites are visible, and they are highlighted by solid arrows and 
colored in white. 



6, of the N-terminal affinity tag. The final model, refined at 2.6-A 
resolution, is of high quality as judged by the electron density map 
(Fig. 4, which is published as supporting information on the PNAS 
web site) and by PROCHECK criteria (Table 1, which is published 
as supporting information on the PNAS web site). The protein 
selected for large-scale production and crystallization contained 
three mutations: the last His of the N-terminal His 6 -tag was 
replaced with Gin, and two other residues, Val-291 and Asp-300, 
were replaced with Gly. The protein was confirmed to possess 
uridylate-specific endoribonuclease activity (Fig. 5, which is pub¬ 
lished as supporting information on the PNAS web site) and the 
temperature factors observed in the Gly-substituted regions were 
within average, indicating that the two replacements did not affect 
the nspl5 local structure. 

The monomer consists of three domains of increasing size (Fig. 
In), a small N-terminal domain (residues —1 to 62), a central 
domain (residues 63-191) and a large C-terminal domain (residues 
192-345). The connectivity and topology of the molecule are given 
in Fig. lb. The N-terminal domain (al-o:2) is formed by a three- 
stranded antiparallel /3-sheet with two helices packed against one of 
its sides. As many as 88 (of 128) residues of the central domain 
(a3-/38) are part of coiled regions. However, only a short stretch of 
amino acids, between positions 110 and 115, have temperature 
factors higher than average, probably because of the extensive 
solvent exposure of this particular segment. Helix a3 is connected 


by a 39-aa-long coil to an a//3 region, consisting in two helices and 
five /3-strands. A central three-stranded /3-sheet (/34, j85, and j88) is 
covered on one side by two helices (a4 and a5), and two short 
antiparallel /3-strands (/36 and /37) are inserted between strand /35 
and j38. The C-terminal domain (a6-/314) contains an a-helical 
core (a6-a9) flanked on both sides by two antiparallel three- 
stranded /3-sheets (/39-/311 and /312-/314). These two /3-sheets form 
a groove that exhibits a positive electrostatic potential (see below). 

A search on the DALI server (22) did not identify structures with 
significant similarity to the nspl5 monomer or its C-terminal 
domain. When the N-terminal and central domains together were 
used as a structural query, no hit was returned at all. Thus, both the 
overall fold of each domain and the domain organization of nspl5 
appear to be undocumented. 

nspl 5 Is a Hexamer. Examination of the crystal packing of nspl5 
suggested a hexameric structure (110 X 80 A), formed by a dimer 
of trimers located around the cubic threefold axis. This observation 
is consistent with previously reported transmission electron micros¬ 
copy data (17) as well as gel filtration experiments (data not shown) 
(23). Each trimer is ring-shaped, and the juxtaposition of the two 
trimers creates a narrow tunnel with a diameter of 15 A in the center 
of the hexamer (Fig. 1 c and e). The three subdomains of the 
monomer are shown in Fig. 1 (Fig. 1 c and cl). The N-terminal 
domain mediates most of the critical intermolecular interactions 
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Fig. 2. Sequence alignment of nspl 5 from SARS-CoV and other coronaviruses as well as XendoU from X. laevis. The numbering refers to the SARS-CoV nspl 5 
sequence (SARS). nspl 5 secondary structures are shown above the sequences. Several mutants of nspl 5 have been previously reported (16, 17). Enzymes are 
shown with substitutions of residues highlighted in the following colors: blue, substitutions were affected in both their enzymatic activity and oligomerization 
state; pink, substitutions revealed enzymes with wild-type properties; green, substitutions affected the activity but not the oligomerization state. The four amino 
acids highlighted by an asterisk are the ones mutated in the present study. Fully conserved residues are shown in red. MHV, mouse hepatitis virus strain 2; BCoV, 
bovine coronavirus (strain 98TXSF-110-ENT); PEDV, porcine epidemic diarrhea virus (strain CV777); HV229E, HCoV-229E; AIBV, avian infectious bronchitis virus 
(strain Beaudette). 


that appear to drive the formation of trimers, including interactions 
with the central and C-terminal domains of the adjacent monomer 
(Fig. 1 c-f). Moreover, the N-terminal domains are placed at the 
interface between the trimers (Fig. Id), where they form a platform 
that supports the formation of the hexamer: as many as 45 (of 51) 
amino acids of the N-terminal subdomain are involved in trimer- 
trimer interactions. Conversely, as shown in Fig. 1 c and d, the 
C-terminal domain (in red) is far away from the trimer-trimer 
interface. The C terminus is mostly solvent-exposed and protrudes 
out of the hexamer core, forming the petals of a flower (Fig. 1 c-f). 
Accordingly, the C-terminal domains are involved in intratrimer 
interactions and not in the interactions between the two trimers. 
Approximately 13% (2,200 A 2 ) of each monomer’s water-accessible 
surface area is involved in the interactions forming the trimer, half 
of this surface with each of the other two molecules. Each monomer 
interacts only with one molecule from the other trimer, with an 
interaction surface area of 803 A 2 , representing only 4.7% of the 
monomer’s water-accessible surface area. 

Mutational Studies. Previously, mutational studies (16, 17) have 
identified residues that are essential for nspl5 endonuclease activ¬ 
ity. Ala substitutions of Asp-272, Lys-289, His-234, or His-249 (Fig. 
2) completely abolished nucleolytic activity, whereas Ala substitu¬ 
tions of Glu-3 and Glu-210, respectively, reduced this activity. In 
human coronavirus 229E (FICoV-229E) nspl5, Ala mutations of 
Asp-6408 and Asp-6435 (corresponding to Asp-296 and Asp-323 in 
the SARS-CoV homolog) led to inactivation of nspl5. Asp-272 is 
located at the interface between the C-terminal and central do¬ 
mains, suggesting an important role for the stability of the nspl5 
monomer. Indeed, the nspl5 D272A mutant was reported to be 
unstable in solution (17). Glu-210 is located in the core of the 
C-terminal domain, where it is part of an H-bond network between 
several secondary structures. Glu-3, by contrast, is located on the 


protein surface. It is involved in trimer-trimer interactions by 
forming two Fl-bonds with one of the monomers from the other 
trimer. Consistent with this observation, Guarino et al. (17) re¬ 
ported that a significant proportion (52%) of the nspl5 E3A mutant 
is monomeric, supporting a critical role of this residue in oligomer¬ 
ization. Taken together, the structural and functional information 
suggests that the enzymatic defects seen in the D272A, E210A, and 
E3A mutants are due to indirect effects, most probably structural 
destabilization of the protein, rather than affecting directly catalysis 
and/or substrate binding. The results obtained for Asp-296 and 
Asp-323 substitutions are even less conclusive: D323A abolished 
the nspl5 activity in FICoV-229E but not in SARS-CoV, and 
D296A inactivated FICoV-229E nspl5, whereas the corresponding 
SARS-CoV D296A mutant could not be obtained in a soluble form 
(17). In our structure, both of these Asp residues lie in the core of 
the C-terminal domain and are involved in an extensive H-bond 
network, with Asp-296 forming five hydrogen bonds and Asp-323 
forming seven. This H-bond network is close to, but not directly 
located in, the putative active site. We therefore believe that the 
latter two Asp substitutions have structural effects on nspl5 that 
indirectly impair the activity of the protein. In this context, the 
recent study by Posthuma et al. (24) is of particular interest. It was 
shown that substitutions of the two presumed catalytic His residues 
in the equine arteritis virus NendoU homolog caused a small- 
plaque phenotype and dramatic losses in virus titers, but, surpris¬ 
ingly, the His substitutions had only moderate effects on viral RNA 
synthesis. By contrast, substitutions of the Asp residues equivalent 
to Asp-296 and Asp-323 in the SARS-CoV nspl5 blocked equine 
arteritis virus RNA synthesis completely, and no viable virus could 
be obtained. Taken together, the structural and functional data 
suggest that the catalytic activity of NendoU has important func¬ 
tions in the nidovirus replication cycle and that the structural 
integrity of nspl5 is absolutely essential for viral replication. 
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Fig. 3. Catalytic site and proposed enzymatic mechanism, (a) Stereoview of the active site groove formed by the nspl 5 C-terminal domain. The catalytic amino 
acids of bovine pancreatic RNase A (in cyan) are superimposed with the putative catalytic amino acids of nspl 5 (in magenta), (b) Proposed enzymatic mechanism 
as suggested by the SARS-CoV nspl5 crystal structure and mutational studies, in analogy with the mechanism established for bovine pancreatic RNase A. (c) 
Stereoview of uridine 3'-phosphate (U3'MP) modeled in the nspl 5 active site. The main hypothetical interactions between the catalytic amino acids and U3'MP 
are shown as dashed lines, (c/) Stereoview of the electrostatic potential of the nspl 5 active site groove. The modeled U3'MP molecule is shown in ball and stick 
representation. 


The nspl 5 Active Site and Uridyl Specificity. The tunnel traversing the 
hexamer is too narrow to accommodate an RNA substrate, a 
feature proposed for the nsp7-nsp8 complex (11). Moreover, 
Glu-44 side chains protrude into the tunnel, rendering its surface 
negatively charged, a feature that is not compatible with RNA 
binding. The active site, therefore, should be located elsewhere. 
Previous mutational studies have identified Asp-272, Lys-289, His- 
234, and His-249 as essential residues for nspl5 endonuclease 
activity (Fig. 2) (16,17). Based on these data as well as on sequence 
comparisons of nidovirus and cellular nucleases, a catalytic center 
involving Lys-289, His-234, and His-249 was proposed (17, 25). In 
our structure, the three putative catalytic residues, Lys-289, His- 
234, and His-249, are indeed gathered together in a positively 
charged groove in the C-terminal domain, far from any monomer- 
monomer interfaces (Fig. 1 e and /), implying that the nspl5 
hexamer contains six independent active sites. 

Interestingly, bovine pancreatic RNase A, for which the catalytic 
mechanism has been well characterized, also employs two His 
residues and one Lys residue in its active site (His-12, Lys-41, and 
His-119). In this case, His-12 and His-119 act as general base and 
general acid, respectively, whereas Lys-41 interacts with the phos¬ 
phate moiety to stabilize the pentacovalent intermediate (21). 
Superimpositions of the nspl5 and RNase A tertiary structures or 
the loops bearing the three essential His and Lys residues proved to 
be impossible. In contrast, however, the Lys-289, His-249, and 
His-234 residues of the SARS-CoV nspl5 could be superimposed 
very nicely onto the RNase A Lys-41^ His-12, and His-119 residues 
with a rms deviation for C“ of 0.3 A (Fig. 3a), indicating that, as 
reported for other catalytic machineries (e.g., that of nucleic acid 
polymerases), the relative positions of the key active-site residues 
are much better conserved than are the general or local protein 
folds. 

Besides the catalytic His-His-Lys triad, two other amino acids, 
Thr-45 and Phe-120, are known to be important for catalysis in 
RNase A and its closest relatives. In RNase A, Thr-45 is responsible 


for the pyrimidine specificity on the 3' side of the cleavage site (21). 
In RNase Tl, Phe-100 (corresponding to Phe-120 in RNase A) 
makes stacking interactions with the ribonucleotide base and its 
phosphate moiety, which likely contributes to the proper position¬ 
ing of the ribonucleotide in the active site (26). Superimposition of 
the RNase A catalytic triad with the nspl5 Lys-289, His-234, and 
His-249 residues revealed that the positions of the nspl5 Ser-293 
and Tyr-342 residues corresponded to those of the Thr-45 and 
Phe-120 residues of RNase A (Fig. 3a). Similar to the Phe-120 in 
RNase A, the aromatic ring of the nspl5 Tyr-342 might establish 
van der Waals interactions with the uracil moiety in the active site. 
Alternatively, Tyr-342 could merely narrow the groove, thereby 
positioning the substrate appropriately for the reaction (Fig. 3 c and 
d). To further assess the significance of these residues, we per¬ 
formed several substitutions in nspl5, focusing on a C-terminal 
conserved amino acid patch (T340A, Y342A, and P343A). As 
shown in Fig. 6, which is published as supporting information on the 
PNAS web site, Pro-343 and Tyr-342 proved to be critically involved 
in activity (see Discussion). 

Furthermore, the excellent superimposition of the SARS-CoV 
nspl5 Ser-293 and RNaseA Thr-45 residues (Fig. 3a) as well as the 
known role of Thr-45 in the pyrimidine specificity of RNase A 
prompted us to investigate the function of Ser-293. We substituted 
this residue and measured the nspl5 endonuclease activity of the 
mutant. As predicted, the activity was significantly decreased but 
not completely abolished (Fig. 6 A, lane 5, andB, lane 5). The final 
intermediate cleavage product and some of the intermediate cleav¬ 
age products corresponded well to those produced by the wild-type 
enzyme (Fig. 6 A) (16), arguing against major changes in the 
specificity of the mutant. Although compatible with a role of 
Ser-293 in substrate binding, the data suggest that yet other residues 
must contribute to the pronounced uridylate specificity of the 
enzyme. In other CoV nspl5s and XendoU, the position of the 
SARS-CoV nspl5 Ser-293 residue is occupied by a conserved Thr 
residue (Fig. 2), providing additional support for the hydroxyl 
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functionality playing a role in a conserved mechanism. This hy¬ 
pothesis receives strong support from mutagenesis data obtained 
for HCoV-229E nspl5 (16), which showed that the replacement of 
the conserved Thr residue (Ser-293 in SARS-CoV) resulted in 
significantly reduced ribonuclease activity. Another residue that 
could be involved in substrate recognition is the conserved Thr-340. 
Also in this case, a substitution by Ala caused a significant reduction 
in activity, but again, no apparent change in specificity was ob¬ 
served. Structure information on a binary complex will be re¬ 
quired to elucidate or even manipulate the enzyme’s specificity 
determinants. 

Discussion 

The unparalleled size of the coronavirus RNA genome and the 
special mechanisms used by these viruses and their closest nidovirus 
relatives to produce an extensive set of subgenomic RNAs is linked 
to a replicative machinery of unusual complexity that also includes 
a stunning variety of enzymatic activities unique among RNA 
viruses. The biological functions (or even structures) of these 
multiple activities are only slowly beginning to emerge (1, 4, 7, 27). 
Based on its homology with XendoU (18), nspl5 could be identified 
as a novel RNA virus endonuclease activity (7, 15, 16). The 
SARS-CoV nspl5 crystal structure that we present here at 2.6-A 
resolution reveals a unique fold that leads us to propose a previously 
undescribed family, endoU, of RNA endonucleases. To date, this 
family encompasses two distantly related groups of cellular (pro¬ 
totyped by XendoU) and viral (prototyped by the coronavirus 
nspl5) uridylate-specific endoribonucleases. Although endonucle¬ 
ases are ubiquitous enzymes that are involved in many aspects of 
RNA metabolism, they are extremely rare in the RNA virus world, 
with only very few exceptions. Thus, for example, orthomyxoviruses 
carry in their PB1 replicase subunit an endonuclease activity 
dedicated to cap-snatching; the pestivirus glycoprotein E ms exhibits 
an endonuclease activity involved in counteracting the host IFN 
response; and retrovirus reverse transcriptases employ RNase H 
activities to remove the RNA template during retroviral DNA 
synthesis. None of these activities are structurally or enzymatically 
related to the NendoU of nidoviruses. 

In general, viruses have evolved a number of strategies to protect 
their genomes from host nucleases. In (—)RNA viruses, the viral 
ssRNA is generally coated with nucleoprotein, protecting the viral 
RNA from cellular nuclease attack. (+)RNA viruses, on the other 
hand, counteract cellular responses triggered by dsRNA by induc¬ 
ing and employing specific membrane compartments that protects 
the viral RNA from cellular nucleases, at least up to the stage where 
the (+)RNA is being used as a messenger RNA that directs viral 
protein synthesis. Based on these considerations, it is safe to assume 
that nidoviruses have evolved a mechanism that regulates and 
tightly controls the NendoU activity so as to act specifically on 
as-yet-unknown substrates in a tightly controlled manner. Potential 
mechanisms involved in the control of the NendoU activity have not 
been identified, but it has been suggested that they involve the 
ribose-2'-0-methyltransferase (nspl6) activity conserved in coro¬ 
na-, toro- and roniviruses (16). This hypothesis is based mainly on 
the fact that 2'-0-methylated RNA is protected from nspl5- 
directed cleavage. Both the conservation of a multitude of enzy¬ 
matic activities in nidoviruses and the specific features of the 
nidovirus life cycle indicate that nidoviruses have evolved currently 
unknown pathways to ( i ) regulate the activity of their RNA- 
dependent RNA polymerase, (ii) modulate processes ancillary to 
replication, or (iii) counteract innate immunity-related mechanisms 
of the host cell. The available evidence suggests that NendoU may 
be part of (some of) these nidovirus-specific pathways. 

Nspl5 is shown here to have an unusual structure made up by six 
monomers bearing six independent catalytic sites that are assem¬ 
bled into a large complex of toric shape. The interaction surface 
area between the subunits ranges between 800 and 1,100 A 2 , which 
is within the range of values reported for, e.g., Ig domain protein 


complexes. This surface area, however, represents only between 
4.7% and 6.5% of the whole monomer surface, which already is in 
the twilight zone for assessing strong quaternary assembly (28). This 
result may explain why, depending on the technique used, mono¬ 
mers, trimers, and hexamers of nspl5 have been observed (17). 
Indeed, during crystallization, which requires high protein concen¬ 
tration, the equilibrium may be shifted toward the hexameric form. 
Surprisingly, the central hole is too narrow to accommodate an 
RNA molecule, and the six active sites are peripheral. The archi¬ 
tecture suggests that nspl5 might be tethered to other partners 
connecting this hexameric assembly to the replication complex or 
that nspl5 may affect cellular processes not directly related to viral 
RNA synthesis. 

The low-resolution hexameric structure reported previously for 
nspl5 (17) indicated that nspl5 might be a hexamer and provided 
first insights into its shape. It turned out to be impossible to fit our 
hexamer structure in the previously reported nspl5 contour, 
whereas the two individual trimers could be fitted readily (Fig. 7, 
which is published as supporting information on the PNAS web 
site). Nevertheless, there were significant inconsistencies between 
our x-ray structure and the EM data. First, the central hole in the 
EM shape is twice as large as the one in the crystal structure. 
Secondly, the distance between the fitted trimers in the EM shape 
is too large for direct contacts to be established. The staining 
techniques used by the authors of the EM study might have caused 
deformations. Furthermore, the 2D crystals were oriented in such 
a way that the trimer-trimer distance could not be estimated. In 
conclusion, we think that the gross features of the trimer shape, but 
not the assembly of the hexamer, are reasonably reflected by the 
EM study. 

Although the general fold of nspl5 does not resemble the 
structures of any other endoribonucleases, striking similarities have 
been observed between the geometries of the RNase A active site 
and what we propose to be the nspl5 active site. A uridine 
3'-monophosphate molecule modeled in the active site suggests that 
a reaction similar to that performed by RNase A is conceivable. The 
catalytic His-234 and His-249 are properly positioned to accept and 
donate protons, as shown in Fig. 3b. The proposed mechanism also 
is consistent with experimental data obtained for nspl5: DNA or 
2'-0-methylated RNA cannot be cleaved by nspl5 because, obvi¬ 
ously, there is no 2'-OH that could be deprotonated by His-249. 
Furthermore, the proposed catalytic mechanism leads to a product 
with a 2'-3' cyclic phosphate (16). However, in this model of 
nspl5-mediated catalysis, the role of the Mn 2+ ions, which are 
known to be essential for activity, remains unclear. Experimental 
evidence suggests that Mn 2+ is not essential for RNA binding to 
XendoU (25) and that the nspl5 oligomerization state is not 
dramatically changed by Mn 2+ ions (17). Tryptophan fluorescence 
data, however, revealed that Mn 2+ ions induce conformational 
changes in the protein (15), pointing to a structural role of these 
ions, for example to induce a switch to the catalytically active 
conformation. To date, our structural results do not confirm nor 
exclude this hypothesis and, clearly, a complex of nspl5 with Mn 2+ 
will be required to resolve this issue. Unfortunately, in our hands, 
nspl5 proved to be unstable in the presence of Mn 2+ ions. In this 
context, it is also worth noting that a Tyr residue (Tyr-342), which 
is absolutely conserved in the endoU family, is located in close 
proximity to the catalytically essential ribose 2'-OH group (Fig. 3 
c and cl). It is therefore tempting to speculate that Tyr-342 might 
coordinate a Mn 2+ ion and thereby play a key role in catalysis. 
Aromatic residues are known to coordinate cations and metal ions, 
such as Mg 2+ and Mn 2+ , through cation-n interactions (29). 
Although long known for their structural role, these interactions 
have recently been revisited regarding their catalytic potential (30). 
Thus, for example, the endonuclease EndN was shown to employ 
Tyr and His residues to coordinate a catalytic Zn 2+ (31). In this 
mechanism, an aromatic ring holds a metal ion, which closely 
interacts with a ligand, such as an hydroxyl group, and the latter is 


11896 | www.pnas.org/cgi/doi/10.1073/pnas.0601708103 


Ricagno et al. 


then activated to perform its function as a nucleophile. Our 
structure suggests that such a configuration is possible and might 
involve the ribose 2'-OH group and the Tyr-342 and His-249 
residues, which are conserved throughout the endoU family (Fig. 
3). But clearly, structure analysis of RNA-nspl5 binary complexes 
is highly desirable to corroborate such a speculative scenario. 

Interestingly, although the active sites of RNase A and nspl5 
could not be superimposed, the relative spatial positions of the key 
catalytic amino acids of the enzymes did superimpose well, which 
might point to yet another case of convergent evolution of catalytic 
mechanisms. In terms of phylogeny, however, it seems most likely 
that the ancestor of the present-day nidoviruses acquired its Nen- 
doU domain from a cellular homolog (prototyped by XendoU). 
The conservation of such a specialized enzyme in the nidovirus 
replicase adds to the sophisticated beauty of these large viral 
replication machineries and suggests functional links between 
nidoviral enzymes and cellular metabolic pathways, which remain 
to be characterized in future studies. 

Materials and Methods 

Expression and Purification and Crystallization. Selenomethionine- 
labeled nspl5 was produced according to published protocols (32). 
The protein was purified and crystallized under the same conditions 
as the native enzyme, although it proved to be even less stable in 
solution than the native nspl5 (23). The crystals grew in the same 
fashion as those of the native protein and reached a comparable size 
after 1 week. Crystals belong to the same cubic space group as the 
native protein, P4 3 32, with unit cell parameters a = b = c — 166.6 
A, and 75% of the crystal volume is occupied by solvent (Vm = 5.0), 
with one molecule in the asymmetric unit (33). Attempts to obtain 
complexes of nspl5 with short RNAs have been unsuccessful, and 
addition of Mg 2+ ions caused the protein to precipitate. 

Data Collection, Structure Determination, and Refinement. Crystals 
of selenomethionine-labeled protein were cryoprotected with 33% 
glycerol, and data were collected at 110 K. nspl5 single-wavelength 
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anomalous dispersion data collection was carried out on the tunable 
beamline ID23 at the European Synchrotron Radiation Facility 
equipped with a MAR Marmosaic 225 charged-coupled device 
detector. Data were processed with MOFLM/SCALA (34, 35). 
Because the intensity of the diffraction was rather weak, we aimed 
to collect a high redundant data set to improve the quality of 
processed data. Phases were calculated at 4.0 A with the program 
SHELXD (36), which located all six selenium sites. Subsequently, 
phase refinement, density modifications, and phase extension to 2.9 
A were performed with RESOLVE (37), yielding a partial model 
with which «*40% of the main chain was traced. Manual model 
building was then carried out with COOT (38). Refinement of the 
model was done with REFMAC5 (39), using the maximum like¬ 
lihood residual, anisotropic scaling, and bulk refinement correction; 
hydrogen atoms were generated during cycles of refinement. The 
model was refined with native data extending up to 2.6 A. 

The analysis of the monomer and trimer interfaces was done with 
the protein-protein interaction server (www.biochem.ucl.ac.uk/ 
bsm/PP/server), and the analysis of the enzyme geometry was 
performed with COOT and PROCHECK (38,40). The topology of 
nspl5 was analyzed by using the protein topology homepage 
(www3.ebi.ac.uk/tops). The structures of nspl5 and RNase A were 
superimposed by using molecule superposition in CCP4 (34). 
Pictures and electrostatic potential surfaces were generated with 
PyMol (DeLano Scientific, South San Francisco, CA), and the 
topology diagram was produced with Topdraw (41). 
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